(12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 

A 

(19) World Intellectual Property a^t^ & ^ 



Organization 
IniemaiionaJ Bureau 

(43) International Publication Date 
21 May 2004 (21.05.2004) 




(10) Internationa] Publication Number 

PCT WO 2004/042026 A2 



(51) International Patent Classification 7 : 
(21) InleraaUonal Application Number: 



C12N 



PCT/US2003/034960 

(22) InternaUonal* Filing Date: 

4 November 2003 (04. 1 1 .2003) 



(25) Filing Language: 

(26) Publication Language: 



English 
English 



(30) Priority Data: 

60/423,556 



4 November 2002 (04. 1 1 .2002 ) US 



(71) Applicants (for all designated States except US): 
FLORIDA STATE UNIVERSITY RESEARCH 
FOUNDATION [US/US]; 2035 E. Paul Dirac Drive, 100 
Sliger Bldg., Tallahassee, FL 32310 (US). REGENTS OF 
THE UNIVERSITY OF CALIFORNIA [US/US]; 1111 
Franklin Street, 12th Floor, Oakland, CA 94607-5200 
(US). 

(72) Inventors; and 

(75) Inventors/Applicants (for US only): ROUX, Kenneth, 
H„ Ph.D. [US/US]; 3000 Valley Brook Rd., Tallahassee, 
FL 32308 (US). TEUBER, Suzanne, S., M.D. [US/US]; 
3716 Modoc Place, Davis, CA 95616 (US). SATHE, 



Shridhar, K., PhJ>. fUS/US); 6613 Man O' War Trail, 
Tallahassee, FL 32306-2400 (US). ROBOTHAM, Jason, 
M., B.S. [US/US]; 1215 Chee Lane, Tallahassee, FL 32304 
(US). 

(74) Agent: ESTEVEZ, Enrique, G.; Allen, Dyer, Doppelt, 
Milbrath & Gilchrist, P.A., Suite 1401, 255 South Orange 
Avenue, P.O. Box 3791, Orlando, FL 32802-3791 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EC, EE, ES, FI, GB, GD, GE, GH, 
GM, HR, HU, ID, TL. IN, IS, JP, KE, KG, KP, KR, KZ, LC, 
LK, LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, 
MX, MZ, NT, NO, NZ, OM, PG, PH, PL, PT, RO, RU, SC, 
SD, SE, SG, SK, SL, SY, TJ, TM, TN, TR, TT, TZ, UA, 
UG, US, UZ, VC, VN, YU, ZA, ZM, ZW. 

(84) Designated States (regional): ARIPO patent (BW, GH, 
GM, KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZM, ZW), 
Eurasian patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), 
European patent (AT, BE, BG, CH, CY, CZ, DE. DK, EE, 
ES, FI, FR, GB, GR, HU, IE, IT, LU, MC, NL, PT, RO, SE, 
SI, SK, TR), OAPI patent (BF, BJ, CF, CG, CI, CM, GA, 
GN, GQ, GW, ML, MR, NE, SN, TD, TG). 

Published: 

without international search report and to be republished 
upon receipt of that report 

[ Continued on next page J 



(54) Title: NUCLEIC ACID AND ALLERGENIC POLYPEPTIDES ENCODED THEREBY IN CASHEW NUTS (Anacardium 
occidentale) 



< 

O 

o 



O 



1 CTTTCTGTTTGCTTTTTAATTCTCTTTC^ 

81 TCGATAGGCTGGATGCCCTTGAACCCGATAACCGAGTTGAGTATGAAGCCGGTACGGTGGAAGCCTGGGATCCTAACCAT 
161 GAGCAATTCCGATGCGCTGGTGTTGCCTTGGTTAGGCATACCATCCAACCTAATGGCCTTCTCTTGCCTCAATATTCTAA 
241TGCTCCTCAACTTATTTACGTTGTCCAGGGTGAGGGTATGACAGGAATATCATATCCAGGATGCCCAGAAACTTACCAAG 
321CGCCCCAACAGGGACGACAACAGGGACAGAGTGGTAGGTTCCAGGACCGGCATCAAAAGATTCGACGC7TCCGTCGAGGC 
401 GATATCATCGCAATCCCCGCCGGAGTAGCACACTGGTGCTACAACGAGGGCAATTCCCCGGTCGTCACTGTTACTCTTCT 
461 AGACGTCTCAAACAGTCAAAATCAGCTTGATAGGACCCCACGAAAATTCCATCTGGCTGGTAACCCAAAAGATGTGTTCC 
561 AGCAGCAGCAACAACACCAATCTCGCGGGCGTAACCTTTTTTCTGGCTTCGATACAGAGTTATTGGCTGAGGCTTTCCAA 
641 GTGGACGAACGTCTCATAAAGCAGCTCAAAAGCGAGGACAACAGGGGTGGCATTGTTAAGGTGAAGGATGACGAACTTCG 
721 GGTGATCCGCCCATCAAGGAGTCAGAGCGAGCGTGGAAGTGAGAGTGAAGAGGAAAGTGAGGATGAAAAACGCCGATGGG 
801GACAGCGTGAGAATGGGATTGAAGAGACCATTTGCACTATGAGACTCAAAGAGAATATCAATGATCCTGCTCGCGCTGAC 
881 ATTTACACCCCAGMGTCGGTCGTCTTACCACACTCMC^GCCTCAACCTCCCAATCCTCAAATGGCTTCAACTCAGTGT 
961TGAAAAGGGTGTGCTATACAAAAATGCTCTAGTGCTGCCACACTGGAACCTCAACTCGCACAGCATAATATACGGGTGCA 
1 04 1 AGGGTAAAGGCCAGGTTCAAGTAGTAGACAACTTCGGCAACAGAGTGTTCGACGGCGAAGTCCGCGAGGGACAGATGTTG 
1121 GTGGTGCCACAAAACTTTGCAGTAGTGAAACGTGCMGAGAGGAMGATTCGAATGGATTTCTTTC AAGACC AATGATC^ 
1201 GGCCATGACGAGTCCTCTCGCTGGACGCACCTCGGTGCTTGGTGGCATGCCAGAGGAAGTGTTAGCCAATGCGTTCCAGA 
1 281 TCTCAAGAGAAGATGCTAGGAAGATC AAGTTCAACAATCAGCAGACAACTTTGACAAGTGGAGAGTCAAGCCACCACATGA 
1361 GG G ATG ATGCTTAA 

(57) Abstract: The invention describes an isolated nucleic acid sequence comprising the nucleotide sequence of SEQ ID NO: 1 or a 
degenerate variant of SEQ ID NO: 1 . The nucleic acid sequence encodes an Ig-E binding immunogenic polypeptide the amino acid 
sequence of which comprises at least one sequence selected from SEQ ID NOS:4-25. The invention additionally provides an in vitro 
diagnostic test for detecting anti-cashew IgE in a patient. The test comprises reacting the patienfs serum with a purified polypeptide 
the amino acid sequence of which comprises at least one sequence selected from SEQ ID NOS:3-25; separating the polypeptide from 
unreacted patient serum; reacting the polypeptide with a labeled human IgE-reactive agent after separating from unreacled patient 
serum; separating the polypeptide from unreacted labeled human IgE-reactive agent; and detecting labeled human IgE-reactive agent 
bound to the polypeptide after separating from unreacted agent to thereby indicate presence in the patient's serum of anti-cashew 
IgE. 
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NUCLEIC ACID AND ALLERGENIC 
POLYPEPTIDES ENCODED THEREBY 
IN CASHEW NUTS (Anacardium occidental) 

Related Application 

This application claims priority from co-pending U.S. provisional application 
Serial No. 60/423,556, which was filed on November 4, 2002, and which is 
incorporated herein by reference in its entirety. 

Field Of The Invention 

The present invention relates generally to the field of cell biology and, more 
particularly, to a nucleic acid sequence and the polypeptides encoded thereby which 
contain an allergen found in cashew nuts. 

Background Of The Invention 

Allergic reactions to cashew nuts (Anacardium occidentale), though relatively 
infrequent, can be varied and even. life threatening reactions include contact or 
systemic dermatitis (to cardol and anacardic acid found in the cashew nut shell 
oil)(Marks et al. 1984), atopic dermatitis, and IgE-mediated systemic allergic 
reactions.(Burks et al. 1998; Tariq et al. 1996; Garcia et al. 2000) Pistachios and 
mangos are other edible members of the Anacardiaceae family and are allergenic 
with pistachio showing extensive in vitro and possible clinical cross-reactivity with 
cashews. (Fernandez, Fiandor, Martinez-Garate, and Martinez 1995) (Parra et al. 
1993) (Quercia, Rafanelli, Marsigli, Foschi, and Stefanini 1999). Cashew nuts are 
widely used in snack foods and as an ingredient in a variety of processed foods such 
as "butters", bakery and confectionery products. 

We have recently demonstrated that the major IgE-reactive proteins in 
extracts of cashew nut are legumin-like proteins and 2S albumins as assessed by 
N-terminal and enzymatic fragment sequencing of native proteins.(Teuber, Sathe, 
Peterson, and Roux 2002) We have also constructed a cashew nut cDNA library 
and, upon initial screening, cloned and sequenced an allergen in the 7S superfamily, 
which includes vicilin-like and sucrose binding proteins. (Wang etal. 2002) Here, we 
present the sequence and characteristics of a second cDNA encoding protein, 
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designated Ana o 2, which is a member of the legumin (11S globulin) family of seed 
storage proteins. We also describe its reaction with sera from cashew-allergic 
patients and map its linear epitopes. 
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Summary Of The Invention 

With the foregoing in mind, the present invention describes a cloned, 
expressed and characterized a major cashew allergen. A cashew cDNA library was 
screened with human IgE and rabbit IgG anti-cashew extract antisera, and a reactive 
non-vicilin clone, sequenced and expressed as a fusion protein in E. coli. 
Immunoblotting was used to screen for reactivity with patient's sera and inhibition of 
immunoblottihg was used to identify the corresponding native peptides in cashew nut 
extract. The identified allergen was subjected to linear epitope mapping using the 
SPOTs solid phase synthetic peptide technology. . 

Sequence analysis showed the selected clone, designated Ana o 2, to be a 
member of the legumin family (an 11S globulin) of seed storage proteins. By 
immunoblotting, 13 of 21 (62%) of sera from cashew allergic patients were reactive. 
Immunoblot inhibition data showed the native Ana o 2 constitutes a major band at 
-33 kD and a minor band at -55 kD. Probing of overlapping synthetic peptides with 
pooled human cashew-allergic sera identified 22 reactive peptides, 7 of which gave 
strong signals. Several Ana o 2 epitopes were shown to overlap those of the peanut 
allergenic legumin, Ara h 3, in position but with little sequence similarity. Greater 
positional overlap and identity was observed between Ana o 2 and soybean glycinin 
epitopes. Conclusion: We conclude that this legumin-like protein is a major allergen 
in cashew nuts. 

Brief Description Of The Drawings 

Some of the features, advantages, and benefits of the present invention 
having been stated, others will become apparent as the description proceeds when 
taken in conjunction with the accompanying drawings in which: 
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FIG 1 shows the nucleotide and derived amino acid sequence analysis of Ana 
o 2 cDNA clone Ana o 2; this figure includes subfigures Al and 2, B, C, and D, all of 
which may be collectively referred to as FIG. 1, and wherein 

FIGS. 1 A1 and 1A2, respectively, show the nucleotide sequence 
5 encoding for the polypeptide of SEQ ID NO:1, referred to as Ana-o-2, including a 
stop codon shown underlined and in bold; and the polypeptide of SEQ ID NO:2, 
which is a tail for the polypeptide of SEQ ID NO:1 ; 

FIG. 1 B shows the derived amino acid sequence of the Ana o 2 coding 
region (SEQ ID NO:1) shown with the presumed signal sequence underlined, sites 
1 0 of sequence insertions shown in bold text, deletions shown in double underline in the 
nine most similar sequences in GenBank (see Table I) as compared to Ana o 2 
indicated; sequences similar to previously reported cashew legumin group proteins 
derived by amino acid sequencing, (Teuber 2002) are shown in bold italics; 

FIG. 1 C shows details on the sites of insertion (red) and 
15 deletions(blue) as described herein; a vertical line between residues indicates the 
sites of insertions in similar peptides with the insertion size (number of amino acids) 
and plant species (two or three letter.code) shown above the insertion site; deletions 
are underlined or overlined, and only deletions and insertions of three or more 
residues are included; abbreviations used are as follows: Rc, Ricinus communis; 
20 Cav, Corylus avellana; Qr, Quercus robur, Ahy, Amaranthus hypochondriacus] Si, 
Sesamum indicum; Ms, Magnolia salicifolia] Vs, Vicia sativa\ Cp, Cucurbita pepo; 
Car, Coffea arabica\ Ah3, Arach'is hypogaea (Ara h 3); and Ah4, Arachis hypogaea 
(Ara h 4); and 

FIG. 1 D shows a comparison of cDNA-derived Ana o 2 sequences to 
25 similar sequences of previously reported cashew legumin group proteins derived by 
amino acid sequencing. (Teuber et al. 2002) . Identical residues are in bold. 
Accession number: AF453947; 

FIG 2 shows the identification of native Ana o 2 peptide in cashew extract by 
means of Western blotting, wherein nitrocellulose blots of total cashew extract (A) 
30 and rAna o 2 (B) were probed with serum from cashew-allergic patient (#9) and 
tagged with 1251-labeled anti human IgE; the first (left) lane in each blot was 
incubated without inhibition; the second through fourth lanes were incubated with 
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rAna o 2, rAna o 1 and/or MBP as indicated; the right lanes were incubated with 
control human sera; arrows indicate locations of native Ana o 1 , top, Ana o 2 (acidic 
subunit), bottom, and an. as yet, unidentified band, middle; the lower and upper 
bands in (B) representing the uncleaved (precursor) form of rAna o 2 in its monomer 
5 and presumptive dimer configurations, respectively; and 

FIG 3 shows a comparison of the IgE-binding epitopes of various 11S 
globulins to those of the cashew legumin, Ana o 2; bracketed and numbered regions 
indicate locations of the reactive peptides in Ana o 2 (also listed in TABLE III); 
shaded segments indicate regions expressing one or more linear epitopes on the 
different allergens; identical amino acids are indicated with a "•" while similar residues 
are depicted with a the symbol 7" shows where a portion of a sequence was 
deleted and a "-" represents where a space was added in the sequence to maximize 
alignment; the presumed cleavage site between the acidic and basic subunits 
(between amino acids. 271 and 272) is shown in text above the sequence; boxes 
denote regions where numerous insertions and deletions are found when comparing 
multiple legumin sequences (see Fig 1C). 

Detailed Descripti on of the Preferred Embodiment 

The present invention will now be described more fully hereinafter with 
reference to the accompanying drawings, in which preferred embodiments of the 
invention are shown. Unless otherwise defined, all technical and scientific terms 
used herein have the same meaning as commonly understood by one of ordinary 
skill in the art to which this invention pertains. Although methods and materials 
similar or equivalent to those described herein can be used in the practice or testing 
of the present invention, suitable methods and materials are described below. All 
publications, patent applications, patents, and other references mentioned herein are 
incorporated by reference in their entirety. In case of conflict, the present 
specification, including any definitions, will control. In addition, the materials, 
methods and examples given are illustrative in nature only and not intended to be 
limiting. Accordingly, this invention may be embodied in many different forms and 
should not be construed as limited to the illustrated embodiments set forth herein. 
Moreover, these illustrated embodiments are provided so that this disclosure will be 
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thorough and complete, and will fully convey the scope of the invention to those 
skilled in the art. Other features and advantages of the invention will be apparent 
from the following detailed description, and from the claims. 

5 Abbreviations Used 

Amino acids: the standard one-letter abbreviations are used. 

DTT: dithiothreitol. 

HRP: horseradish peroxidase. 

Nucleotides: the standard one-letter abbreviations are used. 
10 PCR: polymerase chain reaction. 

PBS: phosphate buffered saline. 

DS-PAGE: sodium dodecyl sulfate polyacrylamide gel electrophoresis. 
TBS: tris-buffered saline. 

15 Methods 

Human sera. 

Blood samples were drawn after informed consent from patients with 
life-threatening systemic reactions to cashew nut and the sera frozen at -70'C until 
use. The study was approved by the human subjects review committee of the 

20 University of California at Davis. The presence of cashew-reactive IgE was 
confirmed by Pharmacia ImmunoCAP assay (Pharmacia, Uppsala, Sweden) or by 
Western immunoblotting. Control sera were obtained from patients with a history of 
anaphylaxis to walnut, pistachio or hazelnut who reported tolerance of cashews. 
Cashew protein extract 

25 An albumin/globulin extract was prepared as previously described. Protein 

concentrations were measured by use of the Bradford protein assay (BioRad 
Laboratories, Inc., Hercules, CA) using bovine serum albumin as the- standard 
protein. 

Production of rabbit polyclonal antiserum. 

30 A rabbit was immunized with 5 mg of cashew extract in Freund's complete 

adjuvant and boosted four weeks later with 5 mg of cashew extract in incomplete 
Freund's adjuvant. The rabbit was subsequently bled and the serum stored at 
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-20°C. Guidelines for animal care and welfare as in the "Guide for the Care and Use 
of Laboratory Animals" prepared by the Institute of Laboratory Animal resources, 
National Research Council, National Academy Press, revised 1996, were followed. 
Construction and IgE immunoscreening of cashew cDNA library. 
5 Library construction and immunoscreening have previously been described 

in detail.(Wang 2002) Briefly, cashew nuts in late maturation were chopped, frozen, 
and ground. Total RNA was extracted in TRIzol (GIBCO BRL Life Technologies Inc.. 
Rockville, MD) and mRNA was isolated using a PolyATtract kit (Promega, Madison, 
Wl). The cDNA library was constructed using the Uni-ZAP XR Gigapack Cloning Kit 
10 (Stratagene Inc., Cedar Creek, TX) and cloned into the lambda Uni-ZAP XR 
expression vector. The library was amplified in E. coli strain XL1-Blue and screened 
with rabbit (IgG) and human (IgE) antisera. The immunopositive clones were picked, 
plaque-purified, and stored in SM buffer supplemented with 2% chloroform at 4*C. 
Sequencing and corresponding analysis of selected genes. 
1 5 Inserts from the selected phage clones were amplified with M1 3 forward and 

reverse primers by PCR. Both strands of the PCR products were then sequenced 
on an ABI 3100 Genetic Analyzer (Foster City, CA) using capillary electrophoresis 
and Version 2 Big Dye Terminators as described by the manufacturer. Similarity 
searches and alignments of deduced amino acid sequences were performed on 
Genetics Computer Group (GCG) software (Accelrys, Inc. San Diego, CA) using the 
BLAST 2.0 program (www.ncbi.nlm.nih.gov/BLAST/). 

Cloning, expression and purification of cDNA-encoded proteins. 
As previously described in detail (Wang et al. 2002), cDNA coding sequences were 
modified by the addition of a Sal I site at 5' end and a Pst I site at 3'end by PCR 
using PfuTurbo DNA polymerase (Stratagene Inc., Cedar Creek, TX) followed by 
digestion with Sal I and Pst I and ligation to maltose binding protein (MBP) fusion 
expression vector P MAL-c2 (New England BioLabs Inc., Beverly, MA), into which a 
thrombin cleavage site had been engineered. 

For expression, competent E. coli BL21 (DE3) cells (Novagen Inc., Madison, 
Wl) were transformed. Positive clones were identified by PCR screening using 
cDNA sequence-specific primers. Single colonies were grown and induced with 
isopropyl-D-thiogalactopyranoside (IPTG). The cells were harvested, lysed with mild 
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sonication, and centrifuged at 12,000g. The lysate supernatant was passed over an 
amylose affinity column and the fusion protein eluted with 10 mM maltose. The 
cDNA insert was produced by PCR amplification of the portion of the sequence 
extending from the presumed first codon following the leader peptide through the last 
codon prior the stop codon using the primers 
TCTAGAGTCGACCGCCAGGAATGGCAAC (forward) and 
GCTTGCCTGCAGTTAAGCATCATCCCTCATG (reverse). Fusion proteins were 
cleaved with thrombin (Sigma, St. Louis, MO) and the liberated recombinant protein 
purified on a Superdex 200 column (Amersham Pharmacia, Piscataway, N J) by high 
performance liquid chromatography (HPLC). Cleaved and un-cleaved recombinant 
proteins were concentrated and either stored (briefly) at 4oC until use or frozen at 
-70oC. 

Gel electrophoresis (SDS-PAGE), immunoblotting, and inhibition. 

r Ana 0 2 (at 10 g/4mm width) or total cashew extract (100 g/4mm width) 
samples were boiled in reducing sample buffer and subjected to electrophoresis, 
immunoblotting and inhibition as previously described (Wang et al. 2002). For 
inhibition experiments, 10 g of rAna o 2, rAna o 1, and/or MBP were pre-incubated 
with human antiserum (from patient #9 at 1:20 dilution) for 1 hr at 37o C and then 
incubated with nitrocellulose strips containing blotted cashew extract overnight at 4o 
C. The strips were then incubated with the appropriate labeled second antibody and 
exposed to x-ray film. 

Solid-phase peptide (SPOTs) synthesis and binding to IgE. 
Based on the derived amino acid sequence of the 457 amino acid Ana o 2 protein 
(including the presumptive leader sequence, amino acids 1-14), 57 overlapping 
15-amino acid peptides, each offset by eight amino acids, were synthesized. An 
additional peptide, corresponding to presumptive N-terminal amino acid, 1 5-29, was 
also produced. Peptides were synthesized on derivatized cellulose sheets using 
9-fluorenlymethoxy carbonyl-derived (Fmoc) amino acids (Genosys Biotechnologies, 
Inc., The Woodlands, TX) and probed as previously described (Wang et al. 2002). 
Briefly, the peptide-containing membranes were washed in TBS and incubated o/n 
at RT in blocking solution, washed in TBS-T and incubated o/n at 4 "C with pooled 
patients' sera diluted 1 :5 (v:v). This incubation was followed by three 5-min washes 
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in TBS-T and an o/n (4 'Q incubation with 1251-anti-human IgE (Hycor Biomedical 
Inc.) diluted 1:10 in a mixture of PBS, 5% nonfat dry milk, and 0.05% Tween-20. 
After additional washes, IgE-peptide reactivity was identified after a 1 -week exposure 
at -70'C to x-ray film or a 17 hr incubation with a Molecular Dynamics 
5 phosphoimaging screen (Kodak). Phosphoimaging data was quantified using a 
Storm 860 scanner (Molecular Dynamics) and its accompanying software. 

Results 

cDNA library screening and gene characterization. 

1 0 The cDNA library was initially screened for reactivity with both rabbit IgG and 

(cashew-allergic) human IgE. As previously reported, a total of 50 clones displaying 
dual reactivity were selected for further analysis (Wang et al. 2002). The first four 
clones selected for sequencing all proved to encode one of two vicilin variants and 
were designated Ana o 1.1 and Ana o 1 .2. A further analysis of the remaining 46 

1 5 clones by PCR using Ana o 1 specific primers showed that 45 of the 46 were Ana o 
1-positive. The remaining clone, upon sequencing and comparison to GenBank, 
proved to be homologous with the legumin family of seed storage proteins (Fig 1 A) 
and was designated, Ana o 2. Table I lists the nine proteins having the highest 
degrees of identity (45% to 58%) and similarity (63% to 74%) to Ana o 2 and include 
20 proteins described as legumin-like, legumin precursor, legumin A precursor, 11S 
globulin, and 11S globulin b-subunit precursor, representing a diversity of tree and 
plant species (Table I). Also listed are four 1 1 S family proteins with somewhat lower 
identity (42%-47%) and similarity (58%-66%) but which have previously been 
identified as legumin allergens. Two of the allergens are from the peanut, Ara h 3 
25 and Ara h 4, and two from soybean, the glycinin G1 and G2 subunits. 
Protein sequence characterization. 

Analysis of the nucleotide (Fig 1A) and deduced amino acid sequences (Fig 
1 B) reveal a 457 amino acid open reading frame, a possible start site at position 93, 
and possible leader peptide from 1 to 15. Comparison of the nine most similar 
30 sequences and the four above mentioned allergens (listed in Table I) with Ana o 2 
amino acid sequence shows clustered sites of insertions (red segments) and 
deletions (blue segments). Figure 1C shows the sites of insertions (blue over and 
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underlined) and deletions (red vertical lines) of four or more amino acids in the 
eleven compared sequences with respect to the Ana o 1 sequence. 
Multiple insertions are seen between amino acid 14 and 23, 104 and 121, and 180 
and 1 98 with additional scattered insertions between 247 and 271 . In contrast to the 
24 insertion, only four deletions are present, all of which are clustered in the 245 to 
270 region. 

We have previously reported N-terminal and tryptic peptide sequences for 
several native IgE reactive cashew proteins (Teuber et al. 2002). Four of these were 
found to have similarity to the cDNA sequence of Ana o 2 (Fig 1D and green 
sequence segments in Fig 1B) though none was identical suggesting that Ana o 2 
is a member of a larger gene family or that there is extensive polymorphism at the 
relevant loci. 

Reactivity of the recombinant protein with human IgE and rabbit IgG. 

For immunological characterization, we cloned and expressed Ana o 2 
beginning at nucleotide 47 (after the presumptive leader peptide) through 1375. The 
DNA segment was ligated into an expression vector designed to allow for purification 
of the recombinant molecule via a mannose-binding protein (MBP) fusion domain in 
conjunction with an amylose affinity column and a thrombin-specific cleavage site. 
The resulting -93 kD Ana o 2 fusion protein was affinity purified as previously 
described and digested with thrombin to yield a -52 kD peptide as well as the 43 kD 
MBP (data not shown). Both cleaved and uncleaved peptides were reactive with 
specific human IgE and rabbit IgG (data not shown). 

Recognition of Ana o 2 as an allergen. 

The prevalence of reactivity to rAna o 2 among cashew allergic patients by 
Western immunoblotting is shown in Table 2. IgE from 13 of 21 (62%) sera from 
patients with a history of life-threatening reactions to cashews bound the 
recombinant. In 6 cases, the intensity of the signal was strong, but was weak in 7 
cases. In contrast, 1 of 10 sera from patients tolerant of cashew but clinically with 
life-threatening reactions to other tree nuts reacted with the rAna o 2. 

Identification of native Ana o 2 by SDS-PAGE immunoblotting. 

Having established that Ana o 2 is a major cashew allergen, we next sought 
to identify the band or bands in a typical total cashew immunoblot that correspond 
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to the cloned polypeptide storage protein precursor. Figure 2A shows nitrocellulose 
strips in which the total soluble cashew extract has been blotted and probed with 
serum from an allergic patient #9. The first strip (left lane) shows this serum reacting 
with three major bands at -55, -36 and -33 kD. The second through fourth lanes 
were incubated with the same human sera preincubated with rAna o 2, rAna o 1 + 
rAna o 2, and the fusion protein, MBP, respectively. Note that Ana o 2 inhibits the 
strong band at -33 kD and weakens the band at -55 kD. The addition of Ana o 1 
inhibits the residual signal at -55 kD. MBP has no inhibitory effect as expected. The 
remaining band at -36 kD is as yet unidentified. The last lane (right) shows no 
reactivity with a patient having allergies to agents other than cashew. 

Figure 2B shows reactivity of the human sera with rAna o 2 (-53 kD) and its 
inhibition with purified rAna o 2 but not rAna o 1 nor MBP. The band at -120 kD is 
presumed to be a dimer of rAna o 2. 

Identification and recognition of IgE-reactive linear epitopes on Ana o 2 
and comparison to peanut and soybean legumin epitopes. 
The entire amino acid length of Ana o 2 was studied by probing 58 
overlapping solid phase synthetic peptides with sera from 12 patients randomly 
assigned to three pools. Collectively, the three pools reacted weakly 
(phosphoimaging score from 2.0-3.9 x 10-3) with 12, moderately (4.0-6.9 x 10-3) 
with three, and strongly ( 7.0 x 10-3) with seven linear IgE-binding epitopes. The 22 
reactive epitopes were distributed throughout the length of the protein (Table III) with 
68% (15/22) of ail epitopes residing on the presumed acidic chain of the protein, 
including 86% (6/7) of those epitopes that were found to react strongly with pooled 
patient IgE. Only two of the identified epitopes were bound by patients' sera from all 
three pools. Epitope #6 was bound strongly by pools 1 and 3, and moderately by 
pool 2. Epitope #3 was bound strongly by pool 1, but moderately by pools 2 and 3. 

To compare the linear epitopes of cashew legumin (Ana o 2) to those of 
peanut (Ara h 3)(Rabjohn et al. 1999) and the soybean legumins (G2 and G1 
glycinin) (Helm et al. 2000b) (Beardslee etal. 2000), we aligned the sequences using 
the BLAST 2.0 program and highlighted the corresponding linear epitopes (Fig. 3). 
Of the regions included in the BLAST alignment, only two of the 22 Ana o 2 linear 
epitope-bearing peptides (#13 and 15) showed significant (>7 amino acid) positional 
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overlap with any of the four previously identified peanut Ara h 3 epitope-bearing 
peptides (Fig 3) (Rabjohn et al. 1999). Even among these two overlapping epitope 
regions, there was little identity or similarity between the Ana o 2 and Ara h 3 
(Rabjohn et al. 1999). A much greater degree of correspondence was observed 
5 when the sequences of Ana o 2 and G2 glycinin were aligned; 9 of the 11 (89%) 
previously identified G2 glycinin epitopes showed significant positional overlap with 
the Ana o 2 epitopes. All 9 showed a significant degree of similarity ( 60%) and two 
were quite similar in sequence to Ana o 2 epitopes #1 (77%) and the adjacent pair, 
#10 and #1 1 (80%) (Fig 3). Two IgE-binding linear epitopes have been identified on 

10 G1 glycinin (Beardslee et al. 2000), both on the acidic chain. Sequence alignment 
revealed that only the second glycinin epitope displayed significant positional overlap 
and no significant identity with a cashew legumin epitope (#13). Interestingly, this 
epitope (#13) is the only one identified that displays positional overlap with an 
epitope in each of the other three compared 11S globulins (Fig 3) suggesting the 

15 possibility of a shared structural motif favoring immunogenicity. 
Comparison of rAna o 2 to native peptides. 

We have recently reported the N-terminal and tryptic peptide sequences of 
four legumin proteins bands (Teuber et al. 2002). In comparing these sequences to 
those of rAna o 2, we note that none are identical in sequence (Table IV). One 

20 tryptic peptide matched at 8 of 9 positions. Two other nearly identical tryptic 
peptides matched rAna o 2 at only 13 of 19 position and 12 of 18 positions. An N- 
terminal peptide matched at 7 or 8 of 10 positions beginning at residue 272 of rAna 
o2. Together, these data suggest that these represent a complex family of 
differentially processed proteins. 

25 Discussion 

Approximately 0.5% of the US population is believed to be allergic to tree nuts 
(Sicherer et al. 1999) and the data from a voluntary registry of peanut and tree nut 
allergic US patients shows 20% of those reporting allergy to tree nuts list sensitivity 
to cashews, the highest percentage for any tree nut (Sicherer et al. 2001). In 
30 another study, restricted to pediatric patients (n=54) at Johns Hopkins (Baltimore, 
Maryland, USA), cashew reactivity ranked fourth (1 1 %) behind walnut (26%), almond 
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(13%), and, pecan (13%), perhaps reflecting a delayed dietary introduction of 
cashews relative to other nuts.(Ehrtich P. 1989) 

A number of plant, animal and fungal allergenic proteins have been identified 
by biochemical and molecular genetic means in recent years reflecting the hope that 
the detailed information provided by such studies will lead to the development of 
hypoallergenic foods, new therapeutic and diagnostic tools, and an increased 
understanding of the. properties that render some proteins more allergenic than 
others. (Heiss et al. 1999; Valenta R et aL 1999; Bannon et al. 2001; Niederberger 
et aL 2001; Bhalla et al. 2001) 

Our previous screening of a cashew nut cDNA library with human 
cashew-allergic sera primarily yielded clones encoding an IgE-reactive vicilin-like 
protein, designated Ana o1 (Wang et al. 2002). Here we describe the product of the 
single identified clone encoding a legumin-family (1 1 S globulin) seed storage protein, 
designated Ana o 2. 

The legumins are large (300-450 kD), generally hexameric, non-glycosylated 
members of a complex family of proteins. Individual subunits are found both as 
intact precursor proteins (50-60 kD) and as processed acidic (30-40 kD) and basic 
(20 kD) subunits that generally remain covalently associated in seeds. In addition 
to Ana o 2, several other legumins have been identified as food allergens. These 
include Ara h 3 (Rabjohn et al. 1 999) and Ara h 4 (Kleber-Janke et al. 1999) of the 
peanut, the soybean G1 and G2 glycinins (Zeece et al. 1999; Helm et al. 2000a; 
Beardslee et al. 2000), Cor a 9 of the hazelnut (Beyer et al. 2002), and possibly a 
legumin from buckwheat (Fujind et al. 2001 ; Yamada et al. 1 995). The legumins are 
related to each other structurally and are believed to share ancestors in common with 
vicilin-like 7S proteins. The vicilins, or 7S globulins, are also multimeric, digestion 
resistant, seed storage molecules found in many edible plants and are frequently 
allergenic. 

More than 50% of the sera from cashew allergic patients (13 of 21, 62%) 
reacted with rAna o 2 formally making this a major allergen. Six of the sera reacted 
strongly (29%). Because of the known complexity of the legumin family, and the 
differences observed between the single recombinant cashew legumin cDNA-derived 
sequence reported here and the sequenced cashew peptide fragments from 
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IgE-reactive proteins previous reported (Teuber et al. 2002) it would not be 
unexpected if additional reactivity to legumin epitopes not represented on our cloned 
and expressed version were yet to be discovered. Nevertheless, at least for the 
patient's serum tested, it appears that the recombinant protein fully inhibited 
reactivity to the native extracted protein suggesting that most or all members of the 
cashew legumin family share common allergenic epitopes. However, further 
investigations comparing additional cloned versions the legumins to each other and 
to highly purified native molecules for reactivity to human IgE will be needed to 
adequately determine if this Ana o 2 clone expresses the full range of cashew 
legumin allergenic epitopes. The unique IgE-binding patterns of each pool of sera 
to the Ana o 2 linear epitopes suggests a heterogeneity of recognition patterns 
between patients though some peptides were recognized by members of two or all 
three pools. 

Linear epitope mapping has shown that the soybean G1 glycinin acidic chain 
may share cross-reactive epitopes with the peanut Ara h 3 (Beardslee et al. 2000). 
Both epitopes mapped from the G1 acidic chain show a considerable degree of 
identity (60% and 70%) with the overlapping region of the homologous Ara h 3 
epitopes though differences were noted in the critical amino acids.(Beardslee et al. 
2000) It has been shown directly that the IgE-binding regions of the soybean G2 
glycinin also bind IgE from peanut sensitive patients [Helm 2000]. Such findings are 
not surprising when one considers that both are members of the legume family and 
patients allergic to one have serum IgE antibodies that immunologically cross-react 
with other legumes. (Eigenmann et al. 1996), While no such direct analysis has 
been performed to determine if there is cross-reactivity between our cashew legumin 
(Ana o 2) and that of peanut (Ara h 3) and soybean (G1 and G2 glycinin), the BLAST 
alignment of these protein sequences and subsequent analysis of the positional 
relationship and degree of homology between linear epitopes on each protein has 
been informative. The major findings are that there is considerable positional 
overlap and some shared sequence identify between Ana o 2 and soybean G2 
glycinin but little in common between Ana o 2 and peanut Ara h 3 linear epitopes. 
We have shown that the majority of Ana o 2 linear epitopes (68% overall and 83% 
of the most strongly reactive) are on the acidic chain of the protein. Previous studies 
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have also observed stronger reactivity with the acidic chain rather than the basic 
chain of allergenic legumins [Pederson 1989](Herian et al. 1990; Rabjohn et al. 
1999; Beardslee et al. 2000)lHelm 2000]. 

It is worth noting that the majority of Ana o 2 epitopes are located in the same 
5 areas of the protein sequence that is evolutionarily least conserved with respect to 
insertions and deletions among a variety of legumins (compare figure 2C with the 
acidic chain of Ana o 2 in figure 3); features that would further minimize 
cross-reactivity between plant legumins. It should also be borne in mind though that 
these various results represent only those reactions between a subset of legumin 
1 0 sequences in each of the compared plant seeds and the IgE from a limited set of 
patients. The inclusion of additional legumins and a broader pool of patient sera in 
future analyses could shed further light on these various relationships. 

Clinical information regarding cross-reactivity between cashews and legumes 
has not been well documented, but based on our findings we can predict that while 
1 5 there may be a basis for cross-reactivity between cashew and soybean, it is unlikely 
that such cross-reactivity exists between cashew and peanut legumins, at least with 
regard to linear epitopes. Similarly, we previously reported that cross-reactivity 
between the vicilin linear epitopes of cashew (Ana o 1) and peanut (Ara h 1) would 
also be unlikely (Wang et al. 2002), observations which, together, further explains 
20 the lack of cross-reactivity between tree nut- and peanut-reactive patients' sera 
(Sicherer et al. 2001). 

To date, no common structural characteristics of linear-lgE epitopes have 
been identified (reviewed in (Bredehorst and David 2001)), but our finding that all 
four legumins analyzed share an epitope with, positional overlap, suggests that 
epitope mapping can provide important clues to the nature of allergenicity. 
Three-dimensional modeling of our cashew legumin is continuing and when 
compared to the models of G2 glycinin (Helm et al. 2000b) and other homologous 
allergens, may provide additional insights. 
The Various Aspects of the Claimed Invention. 

The present invention thus discloses isolated nucleic acid sequences, 
polypeptide products thereof, and associated methods. The skilled will recognize 
that the isolated nucleic acids will be useful at least when expressed in a suitable cell 
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or organism to produce the encoded polypeptides, which in turn may be employed 
in testing to identify patients allergic to cashew nuts. Furthermore, expression of the 
nucleic acid sequences of the present invention in a suitable cell may be useful in 
studying and characterizing gene function. 

Accordingly, the present invention includes an isolated nucleic acid sequence 
comprising the nucleotide sequence of SEQ ID NO:1, or a degenerate variant of 
SEQ ID NO:1. SEQ ID NO:1 is the coding sequence for the described Ana-o-2 
cashew antigen, including a stop codon TAA" at positions 1372-1374. The nucleic 
acid sequence described may also comprise at least one mutation selected from a 
deletion, a substitution, and an addition. A preferred embodiment of the invention 
includes an expression vector comprising the nucleic acid sequence noted above 
operably linked to an expression control sequence, and a cultured cell comprising 
this vector, the celC or a progeny of the cell, preferably being transfected with the 
vector, wherein the cell expresses a polypeptide encoded by said nucleic acid 
sequence. The nucleic acid of SEQ ID NO:1 may also further comprise additional 
nucleotides according to SEQ ID NO:2, which provides a tail for the coding 
sequence. Additionally, the invention further includes an isolated nucleic acid 
comprising a sequence that encodes a polypeptide comprising the amino acid 
sequence of SEQ ID NO:3. An additional aspect of the invention includes a method 
of producing a protein by culturing the cell containing nucleic acid sequence SEQ ID 
NO:1 under conditions permitting expression of a polypeptide encoded by the nucleic 
acid. The method may also include purifying the polypeptide from the cell or the 
medium of the cell. 

The invention also discloses twenty-two (22) linear polypeptide epitopes, as 
listed in Table III, which are also respectively listed herein as SEQ ID NO:4 through 
SEQ ID NO:25. Accordingly, the invention includes an isolated nucleic acid 
comprising a sequence that encodes a polypeptide comprising an amino acid 
sequence selected from SEQ ID NOS:4-25. Additionally claimed is a purified Ig-E 
binding polypeptide the amino acid sequence of which comprises at least one 
sequence selected from SEQ ID NOS:4-25, and wherein the polypeptide comprises 
at least one mutation in said selected amino acid sequence, the mutation being 
selected from a deletion, a substitution, an addition, and particularly a mutation which 
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reduces the • polypeptide's IgE-binding capacity but retains at least some 
immunogenicity. Those skilled in the art should recognize that the scope of the 
invention includes an isolated nucleic acid sequence or a degenerate variant thereof, 
which encodes a polypeptide consisting essentially of an amino acid sequence 
selected from SEQ ID NOS:4-25, and that the selected amino acid sequence may 
include at least one mutation such as a deletion, a substitution, or an addition. It 
should be understood that by "essentially" it is meant that the polypeptide is largely, 
but not wholly, the specified sequence. 

Further method aspects of the disclosed invention include a test for detecting 
a cashew allergy in a patient, said test comprising contacting the patient with an 
amount of a purified polypeptide comprising at least one amino acid sequence 
selected from SEQ ID NOS:3-15. In this test, contacting preferably comprises 
introducing the purified polypeptide into the patient, for example, by injection. The 
skilled will know that injecting includes any of the known routes useful in 
immunological testing of patients, for example, intradermal^, and subcutaneously. 
Additionally, the purified polypeptide for use in such a test may comprise at least one 
mutation in said amino acid sequence, and the mutation may be selected from a 
deletion, a substitution, and an addition. 

Yet a further diagnostic test of the invention for detecting anti-cashew IgE in 
a patient to thereby indicate an allergy to cashews, the test comprising several steps. 
The patient's serum is reacted with a purified polypeptide the amino acid sequence 
of which comprises at least one sequence selected from SEQ ID NOS.4-25. The 
polypeptide is separated from unreacted patient serum, and the polypeptide is 
reacted with a labeled human IgE-reactive agent after separating from unreacted 
patient serum. The polypeptide is then separated from unreacted labeled human 
IgE-reactive agent, and the labeled human IgE-reactive agent bound to the 
polypeptide after separating from unreacted agent is detected directly or indirectly 
to thereby indicate presence in the patient's serum of anti-cashew IgE. The amino 
acid sequence of the polypeptide for use in this test may also comprise at least one 
mutation selected from a deletion, a substitution, and an addition. 

The invention also includes a test for screening patients for allergy to cashews 
which may be embodied in a test kit comprising a first reagent containing at least one 
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purified polypeptide whose amino acid sequence comprises at least one sequence 
selected from SEQ ID NOS:4-25. The test kit preferably also includes a second 
reagent containing at least one labeled human IgE-reactive agent Additionally, in 
the test kit the first reagent may include a solid phase carrying the purified 
5 polypeptide. 

The skilled will find additional guidance in carrying out the invention by 
consulting Sambrook et ah, 1989, Molecular Cloning, A Laboratory Manual, Cold 
Spring Harbor Press, N.Y., and also Ausubel et al. (eds.), 1995, Current Protocols 
in Molecular Biology (John Wiley & Sons, N.Y.). In addition, guidance in diagnostic 

10 and screening test methods suitable for use with the polypeptides of the present 
invention may be found in Manual of Clinical Laboratory Immunology, Sixth Edition, 
Rose et al. (eds.), 2002, American Society for Microbiology, Washington, D.C. As 
noted above, these publications are incorporated herein in their entirety. 

Accordingly, in the drawings and specification, there have been disclosed 

15 typical preferred embodiments of the invention, and although specific terms are 
employed, the terms are used in a descriptive sense only and not for purposes of 
limitation. The invention has been described in considerable detail with specific 
reference to these illustrated embodiments. It will be apparent, however, that various 
modifications and changes can be made within the spirit and scope of the invention 

20 as described in the foregoing specification and as defined in the appended claims. 

Sequence Listings 

Following is a written sequence listing for nucleic acid SEQ ID NOS: 1-2 and 
for polypeptides SEQ ID NOS:3-25. Also enclosed is a complete sequence listing 
25 in computer readable form. The information recorded on the form is identical to the 
written sequence listing. 
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THAT WHICH IS CLAIMED: 

1. An isolated nucleic acid sequence comprising the nucleotide sequence of 
SEQ ID NO:1 or a degenerate variant of SEQ ID N0.1 . 

2. The nucleic acid sequence of Claim 1, further comprising the nucleotide 
sequence of SEQ ID NO:2 or a degenerate variant of SEQ ID NO:2. 

3. The nucleic acid sequence of Claim 1, wherein SEQ ID NO:1 further 
comprises at least one mutation selected from a deletion, a substitution, and an 
addition. 

4. An expression vector comprising the nucleic acid sequence of Claim 1 
operably linked to an expression control sequence. 

5. A cultured cell comprising the vector of Claim 4. 

6. A cultured cell transfected with the vector of Claim 4, or a progeny of said cell, 
wherein the cell expresses a polypeptide encoded by said nucleic acid sequence. 

7 A cultured cell comprising the nucleic acid sequence of Claim 1, operably 
linked to an expression control sequence. 

8. A method of producing a protein, comprising culturing the cell of Claim 6 
under conditions permitting expression of a polypeptide encoded by said nucleic 
acid. 



9. The method of Claim 8 further comprising purifying the polypeptide from the 
cell or the medium of the cell. 
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1 0. An isolated nucleic acid comprising a sequence according to SEQ ID NO: 1 
or a degenerate variant thereof that encodes a polypeptide comprising the amino 
acid sequence of SEQ ID NO:3. 

5 1 1 . An isolated nucleic acid comprising a sequence according to SEQ ID NO: 1 
or a degenerate variant thereof that encodes a polypeptide comprising at least one 
amino acid sequence selected from SEQ ID NOS:4-25. 

12. A purified Ig-E binding immunogenic polypeptide the amino acid sequence of 
0 which comprises at least one sequence selected from SEQ ID NOS:4-25. 

1 3. The polypeptide of Claim 12, further comprising at least one mutation in said 
selected amino acid sequence. 

5 14. The polypeptide of Claim 13, wherein said at least one mutation is selected 
from a deletion, a substitution, and an addition. 

15. The polypeptide of Claim 12, further comprising at least one mutation in said 
amino acid sequence which reduces the polypeptide's IgE-binding capacity. 

16. Atestfor detecting cashew allergy in a patient, said test comprising contacting 
the patient with an amount of a purified polypeptide comprising SEQ ID NO:3. 

17. A test for detecting a cashew allergy in a patient, said test comprising 
contacting the patient with an amount of a purified polypeptide comprising at least 
one amino acid sequence selected from SEQ ID NOS:4-25. 

1 8. The test of Claim 17, wherein contacting comprises introducing the purified 
polypeptide into the patient. 

1 9. The test of Claim 1 7, wherein the purified polypeptide comprises at least one 
mutation in said amino acid sequence. 
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20. The test of Claim 1 9, wherein said at least one mutation is selected from a 
deletion, a substitution, and an addition. 

21. An in vitro diagnostic test for detecting anti-cashew IgE in a patient, said test 
comprising: 

reacting the patient's serum with a purified polypeptide the amino acid 
sequence of which comprises at least one sequence selected from SEQ ID 
NOS:3-25; 

separating the polypeptide from unreacted patient serum; 
reacting the polypeptide with a labeled human IgE-reactive agent after 
separating from unreacted patient serum; 

separating the polypeptide from unreacted labeled human IgE-reactive 
agent; and 

detecting labeled human IgE-reactive agent bound to the polypeptide 
after separating from unreacted agent to thereby indicate presence in the 
patient's serum of anti-cashew IgE. 

22. The diagnostic test of Claim 20, wherein said amino acid sequence further 
comprises at least one mutation selected from a deletion, a substitution, and an 
addition. 

23. A test kit for screening patients for allergy to cashews, said kit comprising a 
first reagent containing at least one purified polypeptide whose amino acid sequence 
comprises at least one sequence selected from SEQ ID NOS:3-25. 

24. The test kit of Claim 23, further comprising a second reagent containing at 
least one labeled human IgE-reactive agent. 

25. The test kit of Claim 23, wherein said first reagent further comprises a solid 
phase carrying the purified polypeptide. 
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26. An isolated nucleic acid sequence according to SEQ ID NO: 1 or a degenerate 
variant thereof, which encodes a polypeptide consisting essentially of an amino acid 
sequence selected from SEQ ID NOS:3-25. 

27. The nucleic acid of Claim 26, wherein said selected amino acid sequence 
further comprises at least one mutation selected from a deletion, a substitution, and 
an addition. 



WO 2004/042026 



1/7 



PCT/US2003/034960 



CD 




o g o o g y 




roo^o><oo<oe>Hb 
h<5oOo^ooi-o<^ 



o < < 



o o ^ 

hO(300h< 
J- o < o o < 
O h O h O (5 



CD CD < < 

o < o o 

OOI- 
(!) CD I- CD 
< < CD CD 




O 
< 

t 3 CD CD O p CD 
t § CD I— CD q CD 
< £ < CD CD H CD 



ll T-a)T-ojnt>tm(DsoowS OT " CNIN " 

T"" T~ T~ 



WO 2004/042026 



PO7US2003/034960 



2/7 



25 



<? < < o 
H < o o 

I— F— <C I— 
f— CD CD CD 

co 

x- 00 v- CSi 



WO 2004/042026 



3/7 



PCT/US2003/034960 



LU 
Q 
D 

> 

> 



02 
> O 



> 

UJ 
LU 
CL 



a o £ 

z o: i_ ^ < a w 
0- o q: o & > y- 
9 a a o < o cr 

LU <f 2 Q 2 e) FT 
> O O HI 2 ^ m 

P V CO CO uj o £2 



a: o co E ^ 
> uj e co a 

LU ^ 



<-CL 

< o: > 
uj o 



or x 

_l LU CO 
< LU CO 
Z £T LU 

< o 
>- or co 





0) 




0 




0 




X 




LL 




—I 




-J 




li. 


00 


OA 




CO 


CD 


— 1 


ll 





UJ O < £ ft"* P- 
o: j Lj nr 1 , 11 



^ U. W W m ?5 

- I Q w d n ^ 

V* T— T— T— T— T- 

(O (N CO ^ O CD CM 
^ t- t- CM CO CO ^ 



WO 2004/042026 PCT/US2003/034960 

4/7 

FIG. 1 C. 



Cav Ms Car Cp Cp 
5 14 4 11 5 

14 A|S|RQ|E|WQQQ|D 23 

Car Ah3,4-MsAhy Ahy Cav 

17 12-4 6 9 7 

105 Y|QAPQQGR|QQG|QSG|RF|Q 121 

Mg Cav RcQrSi V Ah3,4 Cp-Rc-Ahy 

5 21 7 8 6 7 41 9-10-7 

180 G|NPKD|VFQ|Q|Q|QQHQ|SR|GR|N 198 

Ai3-4 Q r 
3-4 7 

225 E|DNRGGIVKVK|D 236 

Ah3 Ys Ah3,4 Cav-Vs 

4 Ms 19 Cp 4 8-7 



245 S RS|OSERG SESIEEES EDEKRRWGO |RD|N 271 



Ahy Si 
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FIG. 1 D. 



cpNA segment 
fryptic fragment, 

fryptic fragment, 
Tryptic fragment, 

N-terminal fragment, 



30 LDALEPDNR38 
LDALEPDNK 

72 GLLLPQYSNAPQLIYWQG 91 
GLLVPSYNN-PELVYWQG 
GLL VPS YNN-PEL VYWQ 

272 GIEETICTMR 281 
GLEETICTMT 

M 
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FIG. 2 
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Figure 3 



Arah3: 2 - ^A%Qpyzy i !4A'FQ-%GR~'%'S'G''YS4'7' 1 A~S4 t A 1 A-E — S-L\TA 1 AR £ tzLt ™ UAi ' VALV ™ TIQP 70 

Glglyc: 7 ^^SGC-F-F^y 3 ^p..y : ..^A 'A -X-K«-G4.^T-X-MKP^-^SC-HMR 76 

G2g/yc: 6 HB^^»M-»Q)S^«^«fKW-»KP-« SC%NR 73 , 



Ana o 2; 7f /tfGLltPQySMAPQZJYWj^ 



Ara h 3: 5S A*R* ~ *V,Y*QW?W^^ 

Glglyc: 77 ^^S^ £^^^^l^ y 2 y£^^y t . ^ yw..^. t42 

G2g/yc;74 -4-/?ff>S-> ^a8BffiBSai^l s-E -XSX-'A- i<Q-P—%H~E-*A- 139 



36 



*GDlJ 136 



Ana o 2: t37 [A/P^6VAtfWCi 
/Ira /i 3: Y37 YzT-F-L 



11 



G2glyc: 140 





2*9 



* 2 » I M 

Ana o 2; *99 LFSGFDTELLAEAFQVD E^^UKQhKSi 
Ara h 3; 242 %~TP*€X~X 4}XX%N-XGS%EE* 
G1 gtyc: 220 ■B^^GM 
G2g/yc: 2f6 yzL-AP-F-K'-G.'AM'A-'A'AXN W-XMS- 




DELRVIRt S-jRS-QSERGSES 



1 



•Kp&E-EYC 306 
'j~pE--Q 1 A PQ E- 280 
•QEDD^AD- 276 



256 



a Acidic Subunit " Bas/c Subunit a 



Ana o 2: 257 E^-agC^ RVVSO-ftP^ q 
Ara n 3; 307>V r -J2*-D.-f?;%/.G ASA 



EETICTMRLKENINDPARADIYTPEVGRL TTLNS 307 
■ 1 A"GRNRSP-~N-!4A'LSKA-D 360 



G1 glyc: 281 ^••^ACKG-XV--X~~Mi^QmSP^.'AA-S 1 A-AT-346 



G2 glyc: 277 -Q/rAY- 1 ASK 1 A-SR-y 2 — 



i'A-GQN'ASP-N'/zA-S'A'AT- 336 



18 



QLS VEKGVL Yr^^LVLPHVVNLNSH^nYGCKGKGdVQ WDNFGNRVFnGF\/t?F(znM 372 



Ana o 2: 308 LNLPl 

Ara h 3: 361 — L -Yr-G-A ■Y-N» 1 A-~F 1 AA -KT->$-"«/^K-^H---"j^^-K-£.^%«/f>< 426 
G1 g/yc: 347 /gg ff S ~?"l££^:^^ — ySA-WA 406 
G2 glyc: 337 B^ttMa^flS^j ^ Hfc-Gfc 40* 



*9 



r 



L 



L 



Ana o 2: 373 L W PGNFAWKRAREERFEWtSFjCFNdR AMTSPLAGRTS VL GG MpEEVLANA FQJSR E&ARKlKFNtjGOT 442 

Ara h 3: 427 *GWj&44~Y*W*«1&+Sim^ 493 W ^ 

G1 glyc: 407 Y2 VAA WAS'AN-'AX TP-IG T—AN - 1 A -NA 'A —Y* Q*A T-N'AKSHQ- - 1 A~N~ 

G2 gtyc: 402 -'A flJWAY&AN-YJA PSIGN-AN % NA%- 



478 
"PFA 470 
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Ana-o-2 SEQS.ST25.txt 
SEQUENCE LISTING 



<110> Florida State University Research Foundation, inc. 
Regents of the University of California 
Roux, Kenneth 
Teuber, Suzanne 
Sathe, Shridahr 
Robot ham, Jason 



Snacardii1ccideStale) Al1er9eniC Pol yP e P tides Encoded hereby in Cashew Nuts 



<130> File No. 32376PCT 

<150> US 60/423,556 
<151> 2002-11-04 

* 

<160> 25 

<170> Patentln version 3.0 

<210> 1 
<211> 1374 
<212> DNA 

<213> Anacardium occi dental e 
<400> 1 

ctttctgttt gctttttaat tctctttcat ggttgcctag cttctcgcca ggaatggcaa~ 60 
caacaagatg agtgccaaat cgataggctg gatgcccttg aacccgataa ccgagttgag 120 
tatgaagccg gtacggtgga agcctgggat cctaaccatg agcaattccg atgcgctggt 180 
gttgccttgg ttaggcatac catccaacct aatggccttc tcttgcctca atattctaat 240 
gctcctcaac ttatttacgt tgtccagggt gagggtatga caggaatatc atatccagga 300 
tgcccagaaa cttaccaagc gccccaacag ggacgacaac agggacagag tggtaggttc 360 
caggaccggc atcaaaagat tcgacgcttc cgtcgaggcg atatcatcgc aatccccgcc 420 
ggagtagcac actggtgcta caacgagggc aattccccgg tcgtcactgt tactcttcta 480 
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Ana-o-2 SEQS.ST25.txt 
gacgtctcaa acagtcaaaa tcagcttgat aggaccccac gaaaattcca tctggctggt 540 

aacccaaaag atgtgttcca gcagcagcaa caacaccaat ctcgcgggcg taaccttttt 600 

tctggcttcg atacagagtt attggctgag gctttccaag tggacgaacg tctcataaag 660 

cagctcaaaa gcgaggacaa caggggtggc attgttaagg tgaaggatga cgaacttcgg 720 

gtgatccgcc catcaaggag tcagagcgag cgtggaagtg agagtgaaga ggaaagtgag 780 

gatgaaaaac gccgatgggg acagcgtgac aatgggattg aagagaccat ttgcactatg 840 

agactcaaag agaatatcaa tgatcctgct cgcgctgaca tttacacccc agaagtcggt 900 

cgtcttacca cactcaacag cctcaacctc ccaatcctca aatggcttca actcagtgtt 960 

gaaaagggtg tgctatacaa aaatgctcta gtgctgccac actggaacct caactcgcac 1020 

agcataatat acgggtgcaa gggtaaaggc caggttcaag tagtagacaa cttcggcaac 1080 

agagtgttcg acggcgaagt ccgcgaggga cagatgttgg tggtgccaca aaactttgca 1140 

gtagtgaaac gtgcaagaga ggaaagattc gaatggattt ctttcaagac caatgatcgg 1200 

gccatgacga gtcctctcgc tggacgcacc tcggtgcttg gtggcatgcc agaggaagtg 1260 

ttagccaatg cgttccagat ctcaagagaa gatgctagga agatcaagtt caacaatcag 1320 

cagacaactt tgacaagtgg agagtcaagc caccacatga gggatgatgc ttaa 1374 

<210> 2 
<211> 296 
<212> DNA 

<213> Anacardium occi dental e 
<400> 2 

attttaagta atttgagctg agctagtggt gatttaaagc cgaatgcatg tggtgtacgt 60 
actatgtttt ttgttttgct ttgtaagggg gataggtaat gaataataaa ggagagcttg 120 
gatagtctct gctgtgagag gggagaagaa agcagggagc agagagcaga gagcttgtat 180 
gtagttaagt taatattact actactacta ctacgatgtg aatgaactct tgatgagttc 240 
tgtccaataa aaaactactt ttcctactca aaaaaaaaaa aaaaaaaaaa aaaaaa 296 

<210> 3 

<211> 457 

<212> prt 

<213> Anacardium occi dental e 

<400> 3 

Leu ser Val cys Phe Leu lie Leu Phe His Gly Cys Leu Ala Ser Arq 
1 5 10 15 

Page 2 
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Ana-o-2 SEQS.ST25.txt 
Gin Glu Trp Gin Gin Gin Asp Glu Cys Gin lie Asp Arg Leu Asp Ala 
20 25 30 

Leu Glu Pro Asp Asn Arg val Glu Tyr Glu Ala Gly Thr val Glu Ala 
35 40 45 

Trp Asp Pro Asn His Glu Gin Phe Arg cys Ala Gly Val Ala Leu Val 
50 55 60 

Arg His Thr lie Gin Pro Asn Gly Leu Leu Leu Pro Gin Tyr ser Asn 
65 70 75 80 

Ala Pro Gin Leu lie Tyr Val Val Gin Gly Glu Gly Met Thr Gly He 
85 90 95 

Ser Tyr Pro Gly Cys Pro Glu Thr Tyr Gin Ala Pro Gin Gin Gly Arq 
100 105 110 

Gin Gin Gly Gin Ser Gly Arg Phe Gin Asp Arg His Gin Lys He Arq 
115 120 125 

Arg phe Arg Arg Gly Asp lie He Ala He Pro Ala Gly Val Ala His 
130 135 140 

Trp Cys Tyr Asn Glu Gly Asn Ser Pro Val val Thr val Thr Leu Leu 
145 150 155 160 

Asp val Ser Asn Ser Gin Asn Gin Leu Asp Arg Thr Pro Arq Lys Phe 
165 170 175 

His Leu Ala Gly Asn Pro Lys Asp val Phe Gin Gin Gin Gin Gin His 
180 185 190 

Gin Ser Arg Gly Arg Asn Leu Phe Ser Gly Phe Asp Thr Glu Leu Leu 
195 200 205 

Ala Glu Ala Phe Gin val Asp Glu Arg Leu lie Lys Gin Leu lvs ser 
210 215 ~ 220 

Glu Asp Asn Arg Gly Gly lie val Lys Val Lys Asp Asp Glu Leu Arq 
225 230 235 240 

val lie Arg Pro Ser Arg ser Gin Ser Glu Arg Gly Ser Glu ser Glu 
245 250 255 

Glu Glu Ser Glu Asp Glu Lys Arg Arg Trp Gly Gin Arg Asp Asn Gly 
260 265 270 

lie Glu Glu Thr lie Cys Thr Met Arg Leu Lys Glu Asn He Asn Asp 
275 280 285 

Pro Ala Arg Ala Asp lie Tyr Thr Pro Glu val Gly Arg Leu Thr Thr 
290 295 300 

Leu Asn Ser Leu Asn Leu Pro lie Leu Lys Trp Leu Gin Leu Ser val 
305 310 315 320 

Glu Lys Gly val Leu Tyr Lys Asn Ala Leu val Leu Pro His Trp Asn 
325 330 335 

Leu Asn Ser His Ser He He Tyr Gly Cys Lys Gly Lys Gly Gin Val 
340 345 350 

Gin Val val Asp Asn Phe Gly Asn Arg val Phe Asp Gly Glu val Arq 
355 360 365 

Glu Gly Gin Met Leu Val val Pro Gin Asn Phe Ala val val Lys Arq 
370 375 380 
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Ana-o-2 SEQS.ST25.txt 

Ala Arg Glu Glu Arg Phe Glu Trp He Ser Phe Lys Thr Asn Asp Arg 
385 390 395 400 

Ala Met Thr Ser Pro Leu Ala Gly Arg Thr Ser val Leu Gly Gly Met 
405 410 415 

Pro Glu Glu Val Leu Ala Asn Ala Phe Gin lie Ser Arg Glu Asp Ala 
420 425 430 

Arg Lys lie Lys Phe Asn Asn Gin Gin Thr Thr Leu Thr Ser Gly Glu 
435 440 445 

ser Ser His His Met Arg Asp Asp Ala 
450 455 

<210> 4 

<211> 15 

<212> PRT 

<213> Anacardium occi dental e 
<400> 4 

Leu Ser Val Cys Phe Leu lie Leu Phe His Gly Cys Leu Ala Ser 
15 10 15 

<210> 5 

<211> 15 

<212> PRT 

<213> Anacardium occi dental e 

<400> 5 

Phe His Gly Cys Leu Ala Ser Arg Gin Glu Trp Gin Gin Gin Asp 
1 5 10 15 

<210> 6 

<211> 15 

<212> PRT 

<213> Anacardium occi dental e 

<400> 6 

ser Arg Gin Glu Trp Gin Gin Gin Asp Glu Cys Gin lie Asp Arg 
1 5 10 15 

<210> 7 
<211> 15 
<212> PRT 

<213> Anacardium occi dental e 

Page 4 



WO. 2004/042026 



Ana-o-2 SEQS.ST25.txt 
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<400> 7 

Gin Glu Trp Gin Gin Gin Asp Glu Cys Gin lie Asp Arg Leu asd 
1 5 2.0 15 

<210> 8 

<211> 15 

<212> PRT 

<213> Anacardium occi dental e 

<400> 8 

Gin Gly Glu Gly Met Thr Gly He Ser Tyr Pro Gly Cys Pro Glu 
1 5 10 ^ 15 

<210> 9 

<211> 15 

<212> prt 

<213> Anacardium occi dental e 

<400> 9 

Tyr Gin Ala Pro Gin Gin Gly Arg Gin Gin Gly Gin Ser Gly Arq 
1 5 10 15 

<210> 10 

<211> 15 

<212> prt 

<213> Anacardium occi dental e 



<400> 10 

Gin Gin Gly Gin Ser Gly Arg Phe Gin Asp Arg His Gin Lys lie 
1 5 10 15 

<210> 11 
<211> 15 
<212>* PRT 

<213> Anacardium occi dental e 
<400> 11 

Gin Asp Arg His Gin Lys He Arg Arg phe Arg Arg Gly Asp He 
1 5 10 15 
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<210> 12 Ana-o-2 SEQS.ST25.txt 

<211> 15 
<212> PRT 

<213> Anacardium occi dental e 
<400> 12 

Ala He Pro Ala Gly Val Ala His Trp Cys Tyr Asn Glu Gly Asn 
15 10 15 

<210> 13 
<211> 15 
<212> PRT 

<213> Anacardium occidentale 
<400> 13 

Leu Asp Arg Thr Pro Arg Lys Phe His Leu Ala Gly Asn Pro Lys 
15 10 15 

<210> 14 
<211> 15 
<212> PRT 

<213> Anacardium occidentale 
<400> 14 

Val Phe Gin Gin Gin Gin Gin His Gin Ser Arg Gly Arg Asn Leu 
15 10 15 

<210> 15 

<211> 15 

<212> PRT 

<213> Anacardium occidentale 

<400> 15 

Arg Leu He Lys Gin Leu Lys Ser Glu Asp Asn Arg Gly Gly He 
1 5 10 15 

<210> 16 
<211> 15 
<212> PRT 

<213> Anacardium occidentale 
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Ana-o-2 SEQS.ST25.txt 

<400> 16 

Lys Val Lys Asp Asp Glu Leu Arg val He Arg Pro Ser Arg ser 
1 5 10 15 

<210> 17 

<211> 15 

<212> PRT 

<213> Anacardium occi dental e 
<400> 17 

Val lie Arg Pro Ser Arg Ser Gin Ser Glu Arg Gly Ser Glu Ser 
15 10 15 

<210> 18 

<211> 15 

<212> PRT 

<213> Anacardium occi dental e 

<400> 18 

Glu Glu Ser Glu Asp Glu Lys Arg Arg Trp Gly Gin Arg Asp Asn 
1 5 10 15 

<210> 19 

<211> 15 

<212> PRT 

<213> Anacardium occi dental e 
<400> 19 

Leu Lys Trp Leu Gin Leu Ser Val Glu Lys Gly Val Leu Tyr Lvs 
1 5 io if 

<210> 20 

<211> 15 

<212> PRT 

<213> Anacardium occidental e 

0 

<400> 20 

Ala Leu Val Leu Pro His Trp Asn Leu Asn ser His ser lie lie 
1 5 10 15 

<210> 21 
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Ana-o-2 SEQS.ST25.txt 

<211> 15 
<212> PRT 

<213> Anacardium occi dental e 
<400> 21 

Leu Asn ser His Ser He lie Tyr Gly cys Lys Gly Lys Gly Gin 
1 5 10 15 

<210> 22 

<211> 15 

<212> PRT 

<213> Anacardium occi dental e ^ 
<400> 22 

Gin Asn Phe Ala val Val Lys Arg Ala Arg Glu Glu Arq Phe Glu 
1 5 10 15 

<210> 23 

<211> 15 

<212> PRT 

<213> Anacardium occi dental e 

<400> 23 

Ala Arg Glu Glu Arg Phe Glu Trp He Ser Phe Lys Thr Asn Asp 
1 5 io is H 

<210> 24 

<211> 15 

<212> PRT 

<213> Anacardium occi dental e 

<400> 24 

Pro Glu Glu Val Leu Ala Asn Ala Phe Gin He Ser Arg Glu Asp 

1 5 io i5 H 

<210> 25 

<211> 15 

<212> prt 

<213> Anacardium occi dental e 
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Ana-o-2 SEQS.ST25.txt 



<400> 25 



Phe Gin He Ser Arg Glu Asp Ala Arg Lys He Lys Phe Asn Asn 
15 10 15 
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